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Abstract 

The importance of studying properties of networks is manifest in numerous fields ranging from biology, physics, 
chemistry, medicine, neuroscience, and engineering. The functionality of networks with regard to performance, 
throughput, reliability and robustness is strongly linked to the underlying geometric and topological properties 
of the given network, which is the focus of this paper, especially as applied to certain biological networks. The 
fundamental mathematical abstraction of a network as a weighted graph brings to bear the tools of graph theory-a 
highly developed subject of mathematical research. But more importantly, recently proposed geometric notions of 
curvature on very general metric measure spaces allow us to utilize a whole new set of tools and ideas that help 
quantify functionality and robustness of networks modelled as graphs. In particular, robustness is closely connected 
to network entropy which, in turn, is very closely related to curvature. There are a number of alternative notions 
of discrete curvature that are compatible with the classical Riemannian definition, each having its own advantages 
and disadvantages, and are relevant to networks of interest. We will concentrate on the role of curvature for certain 
key cancer networks in order to quantitatively indicate their apparent functional robustness relative to their normal 
counterparts. 


1. Introduction 


The growing importance of studying complex networks has been documented in a huge and growing 
literature, and has even been referred to as the field of network science [|4|. We are interested in the 
deep and intimate relationship between network functionality in terms of robustness, on the one hand, 
and topological and geometric properties such as curvature on the other. Following [fTTlI . we will give a 
formal definition of “robustness” in terms of the rate function from large deviations theory. This definition 
captures the essential property of complex networks of being able to adapt to dynamic changes (modelled 
as random perturbations), while maintaining functionality O. One of the key ideas is based on the 
positive correlation between an increase of curvature and network functional robustness, as we will argue 
in Section |II-C[ As we will explain, the necessary geometric notions are quite easy to compute in terms 
of the graph Laplacian or the Earth Mover’s Distance and, thus, one can assess and quantify robustness 
of networks quite efficiently. This relationship is particularly compelling because, on one hand, curvature 
is an equilibrium property of the system (as is the entropy) while, on the other, robustness is a dynamic 
(non-equilibrium) property. Nevertheless this correlative relationship holds. 


We will also see that network curvature is closely related to network entropy. In fact, one of the ways of 
generalizing curvature to rather general metric measure spaces is to exploit the convexity properties of 
the entropy along geodesic paths defined on the associated space of probability measures with an induced 
Riemannian structure [Q, [EH, 1351. In the evolutionary biology literature, it has been noted that entropy 
is closely related to network topology; see [ITlIl . In the latter work, the authors argue (based on Darwinian 
principles) that entropy is a selective criterion that may account for the robustness and heterogeneity of 
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both man-made and biological networks. As we have just mentioned, in this present work, we will argue 
that curvature from network geometry is also strongly related to functional robustness. 

Much of the work in this paper is motivated by the problem of seeking basic principles and commonalities 
across many types of complex networks in addition to taking the data-driven point of view [[0. Indeed, 
search engines and cell biology have a number of things in common; they need to process incomplete and 
noisy data and rely on input that is stochastic in nature. Search engines rely on empirical distributions 
that are based on very limited finite sampling, often of poorly understood features. Cell biological cycles 
are characterized by empirical distributions that are driven by poorly understood environmental cues 
and intracellular checkpoints. This motivates problems that include the characterization of robustness, 
reliability, and possible uncertainty principles as they pertain to information transfer across networks. 

In this note, the central object of study will be cancer networks, and the possibility of using graph curvature 
information to identify key hubs and possible targets of interest in a quantitative manner. Indeed, for 
biological networks in connection with various types of cancer, it appears that entropy is a key network 
property in revealing “cancer hallmarks” ll59ll : thus, generalizations of curvature that are based on entropy 
[|^ would seem most appropriate in this context. We will include some background material to make 
our presentation relatively self-contained. Thus this note will have a tutorial flavor as well. The present 
work is an extended version of [[50l . 


II. Background on curvature 

Various networks may be regarded as geodesic metric spaces, say via the hop metric for communication 
networks [|28l . Since the object of interest for a cancer network will be a weighted graph (see Section [TV]), 
we will consider notions of curvature that best fit this mathematical model, and can lead to interesting 
new quantitative biological insights. Accordingly, we will first sketch some material on curvature, before 
moving on to the proposed notions for networks modelled as graphs. 


A. Wasserstein distance 


We begin by recording the basic definition of the L^-Wasserstein distance from optimal transport theory 
that we will need below. For full details about the Monge-Kantorovich (optimal mass transport) problem 
and the associated Wasserstein distance, we refer the interested reader to [|T3ll . [I42ll . [[55ll . [[56]| . Let X be 
a metric measure space, equipped with distance d. Let i = 1, 2, be two measures with the same total 
mass and finite p-th moment. A coupling between pi and p 2 is a measure p on X x X such that 

/ dp{x,y) = dpi{x), / dp{x,y) = dp 2 {y). 

J y J X 

In other words, the marginals of p are pi and p 2 . Let Yi{pi^p 2 ) be the set of couplings between pi and 
d2- 


We define the Wasserstein distance as 


Wp{pi,P2) 


{ inf 


d{x,yydp{x,y) 


i/p 


In this paper, we will only consider the cases p = 1, 2. For p = 1, the Wasserstein distance is sometimes 
called the “Kantorovich-Rubinstein distance” or the Earth Mover’s distance (EMD). 
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B. Generalities on Ricci curvature 

In order to motivate generalized notions of Rieei eurvature suitable for eomplex networks, we will begin 
with an elementary treatment of eurvature following IIT^ . [l55ll . [|5^ . For M. an n-dimensional Riemannian 
manifold, x G M., let T^M. denote the tangent spaee at x, and mi,M 2 G T^M. orthonormal veetors. Then 
for geodesies 7 i(f) := exp(fMj), z = 1,2, the sectional curvature K{ui,U 2 ) measures the deviation of 
geodesies relative to Euelidean geometry, i.e., 

<i(7i(«).72(i)) = '/2t(l - + 0(t% (1) 

The Rieei eurvature is the average seetional eurvature. Namely, given a (unit) veetor u G T^M, we 
eomplete it to an orthonormal basis, u,U 2 , ■ ■ ■ ,Un- Then the Ricci curvature is defined by Ric{u) : = 
X]r= 2 (There are several different sealing faetors used in the literature. We have followed 
[fT^ .f It may be extended to a quadratie form, giving the so-ealled Ricci curvature tensor. Rieei eurvature 
is strongly related to the Laplaee-Beltrami operator. Indeed, in geodesie normal eoordinates we have that 

Rij = —3/2A.gij, 

where gij denotes the metrie tensor on Ai. 


J(t) 



Fig. 1: Explanation of eurvature in terms of Jaeobian. 


There is another way to understand eurvature: it gives a related measure of the spreading of geodesies in 
the following manner ifT^ . Let 7 denote a geodesie and 7^ a smooth one parameter family of geodesies 
with 7o = 7 . Then a Jacobi field may be defined as 


ds 

It may be regarded as an infinitesimal deformation of the given geodesie. Then it is standard that J{t) 
(essentially the Jaeobian of the exponential map) satisfies the Jaeobi equation: 

^J(f) + i((J(f),7(f))7(f) = 0, 

where £ denotes eovariant derivative, and R is the Riemann eurvature tensor; see Figure 


( 2 ) 
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We want extend these notions to discrete graphs and networks. For discrete spaces corresponding to 
networks modeled as graphs, ordinary notions such as differentiability needed to define Ricci curvature 
as in the previous section do not make sense. There is however a very nice argument due to Villani [|56ll 
that indicates a possible way to getting around such difficulties via two approaches to convexity. More 
precisely, let / : M” —)■ M. Then if / is convexity may be characterized as 

vV(x) > 0 

for all X. Villani calls this an analytic definition of convexity (as the usual definition of Ricci given above). 
On the other hand, one can also define convexity in a synthetic manner via the property that 

/((I - t)x + ty) < (1 - t)f{x) + tf{y), 

for all x,y G and t G [0,1]. In the latter case, no differentiability is necessary. 

Following ll^ . one may define a synthetic notion of Ricci curvature in terms of so-called displace¬ 
ment convexity inherited from the Wasserstein geometry on probability measures. In ll^ . this is done 
for measured length spaces, that is, metric measure spaces in which the distance between two points 
equals the infimum of the lengths of curves joining the points. For discrete spaces (such as those arising 
in network theory), there are several possibilities that we will compare, especially those from [|3, [|T^ . 
llTbll . There have been several other approaches as well to defining Ricci curvature; see f[3^ . [[^ and the 
references therein. 

To conclude, let us just give one elegant result [l55l . Assume that Ad is a Riemannian manifold, and let 
P(A4) denote the space of probability measures on A4 equipped with the Wasserstein 2-metric. Let pt 
denote a geodesic curve on V{M.). Then Ric > 0 if and only if the curve t —)■ f ptlogpt is convex (in 
t). Thus lower bounds on the Ricci curvature are closely connected to entropy and geodesics on spaces 
of probability measures. 


C. Curvature and robustness 

As previously remarked, there have been a number of approaches (see BTl, 041 . [[81 . O^ and the references 
therein) to extending the notion of Ricci curvature to more general metric measure spaces. At this point, 
the exact relationship of one approach as compared to another is unclear. Roughly, the techniques fall into 
two categories: the first generalizing the weak /c-convexity of the entropy functional on the Wasserstein 
space of probability measures as in O^ . 051 . llTl . and the second directly working with Markov chains 
to define the generalization on networks as in O^ . 04l . [f8l . Finally there is a notion of “hyperbolicity” 
due to Gromov llT9l based on the “thinness” or “fatness” of triangles compared to the Euclidean case, and 
more generally a certain four-point criterion. Depending upon the application, each approach seems to be 
useful, and so we will explicate key aspects and their possible applications to various network problems. 
In particular, we describe in some detail the formulations of curvature in [[32l, [|36l, [|T71 because of 
connections to notions of metric entropy. 

We begin with a characterization given in Lott and Villani [[32l. Let {X,d,m) denote a geodesic space, 
and set 

V{X,d,m) := {/i > 0 : / pdm = l}, (3) 

Jx 

V*{X,d,m) := {p & V{X,d,m) : lira / plogpdm<oo}. (4) 

"AO 


//(fi) 


lim / 

"AO J ^>e 


We define 


p log p dm, for p E V* (X, d, m), 


(5) 
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which is the negative of the Boltzmann entropy S{p) := note that the concavity of S is equivalent 

to the eonvexity of H. Then we say that X has Ricci curvature bounded from below by k if for every 
/io,/xi G V{X), there exists a eonstant speed geodesie pt with respeet to the Wasserstein 2-metrie 
eonneeting and pi sueh that 

S{pt) > tS{po) + (1 - t)S{pi) + 0 < f < 1. (6) 


This indieates the positive correlation of entropy and eurvature that we will express as 

A5 X XRic > 0. (7) 


We will deseribe appropriate notions of Rieei eurvature and entropy on graphs below. We just note here 
that ehanges in robustness, i.e., the ability of a system to funetionally adapt to ehanges in the environment 
(denoted as XR) is also positively correlated with entropy via the Fluetuation Theorem liTOl . Il9l . and thus 
with network curvature: 

XR X XRic > 0. (8) 


Sinee the eurvature is very easy to eompute for a network, this may be used as an alternative way of 
expressing funetional robustness. 


Based on the work of m, El, it seems that many normal eellular protein interaetion networks have 
lower entropy than their caneerous analogues. Funetionally this makes sense: since oncoproteins are many 
times disordered (have a larger number of degrees of freedom), they are able to respond better to ehanges 
in the eellular environment. The notion of curvature now allows to precisely quantify this adaptive 
ability. In Section IV we will apply eurvature to eertain eaneer networks to test the idea robustness 
expressed in a geometrie manner. 


D. Remarks on the Fluctuation Theorem 

The form of the Fluctuation Theorem that we use was formulated in tfTOl . Il9l . The idea is very simple and 
eomes from the theory of large deviations [l53l . One considers random fluetuations of a given network 
that result in perturbations of some observable. We let Pe{t) denote the probability that the mean deviates 
by more than e from the original (unperturbed) value at time t. Sinee pg(f) —)■ 0 as f —)■ cxd, we want to 
measure its relative rate, that is, we set 

R := lim(-y logp,(f)). 

t—>-oo t 

Therefore, large R means not much deviation and small R large deviations. In thermodynamies it is 
well-known that entropy and rate funetions from large deviations are very closely related. The Fluetuation 
Theorem is an expression of this fact for networks, and may be expressed as 

XSxXR> 0. (9) 

The Fluctuation Theorem has consequences for just about any type of network: biological, communication, 
social, or neural. In rough terms, it means that the ability of a network to maintain its functionality in 
the faee of perturbations (internal or external), ean be quantified by the eorrelation of aetivities of various 
elements that eomprise the network. In the standard statement, this eorrelation is given via entropy. We 
have formulated this now in terms of eurvature. 
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III. Ollivier-Ricci curvature 

For the weighted graphs used to model the eaneer networks of interest in this paper, we will employ a neat 
notion of Rieei eurvature due to ll36l . [|37]| . The approaeh was inspired from several different direetions 
from properties of Rieei eurvature in the eontinuous ease. Referring to Figure the idea is that for two 
very elose points x and y with tangent veetors w and w', in whieh w' is obtained by a parallel transport 
of w, the two geodesies will get eloser if the eurvature is positive. This is refleeted in the faet that the 
distanee between two small (geodesie balls) is less than the distanee of their eenters. Rieei eurvature along 
direetion xy refleets this, averaged on all direetions w at x. Similar eonsiderations apply to negative and 
zero eurvature [l57l . 


d(x,y)(1- C^Ric/2M) on average 



Fig. 2: Positive eurvature is deteeted when distanee from one ball to another is smaller than those of their 
eenters on the average. 


More formally, we define for (X, d) a metrie spaee equipped with a family of probability measures 
{fix : X G X}, the Olliver-Ricci curvature n{x,y) along the geodesie eonneeting x and y via 

= (1 - K(x,2/))d(x,2/), (10) 


where, as noted earlier, Wi denotes the Earth Mover’s Distanee (Wasserstein 1-metrie), and d the geodesie 
distanee on the graph. For the ease of weighted graphs of greatest interest in networks, we put 


dx 


'^xy 

y 

'Wxy 


the sum taken over all neighbors of x where Wxy denotes the weight of an edge eonneeting x and y (it 
is taken as zero if there is no eonneeting edge between x and y). The measure fix may be regarded as 
the distribution of a one-step random walk starting from x. As is argued in [f3^ . this definition is more 
inspired from an approaeh sueh as that given via equation Q. An advantage of this, is that it is readily 
eomputable sinee the Earth Mover’s Metrie may be eomputed via linear programming [l56ll . 


Note that if we define the Laplaeian operator via 

^f{x) = fix) — f (y)yxiy), f real-valued funetion, 

y 

this eoineides with the usual normalized graph Laplaeian operator It is also interesting to note in 
this eonneetion that if k < K,ix,y) is a lower bound for the Rieei eurvature, then the eigenvalues of A 
may be bounded as A; < A 2 < ... Aat < 2 — fc; see [|26l for the exaet statement. (The first eigenvalue 
Ai = 0.) This relationship is very important sinee 2 — Atv measures the deviation of the graph from being 
bipartite, that is a graph whose vertiees ean be divided into two disjoint sets U and V sueh that every 
edge eonneets a vertex in U to one in V. Sueh ideas appear in resouree alloeation in eertain networks. 
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Lastly, while this work demonstrates the usage of eurvature to quantify robustness of networks (and in 
partieular eaneer networks), altering the network properties to uneover potential drug targets provides an 
exeiting avenue. With this in mind, one may eonsider the eorresponding Rieei flow: 

y) = -k{x, y)d{x, y). (11) 

Not mueh is known about this flow, but the idea would be, while keeping the same topology, to ehange the 
graph weights, or the network of links among the nodes, in sueh a way as to uniformize the eurvature k. 
As argued in [15^ . this eould have the effeet (in case of certain wireless networks) of removing some of the 
overloaded queues, and thus may have important implications for cancer networks as well. Understanding 
discrete analogues of Ricci flow in this connection will be considered as a future research topic in this 
connection. Finally, we should note that the Ollivier-Ricci curvature gives the value of the ordinary Ricci 
curvature on a Riemannian manifold after scaling. 


A. Examples of fluctuation and curvature 


Mean-Reverting Process: It is very informative to consider the relationship of the Ollivier-Ricci curvature 
and robustness via a simple example ll^ . lITTl . Consider the Omstein-Uhlenbeck process 

dXt = —aXtdt + adWti -A(0) = Xq, (12) 


where W is Brownian motion (Wiener process), and we take xq to be deterministic. We treat the 1- 
dimensional case for simplicity. Everything goes through in higher dimensions as well. The corresponding 
Fokker-Planck equation is 


dp 

~dt 


dxp 

~ Cr — -f 


a‘ 


d'^p 


dx 2 dx'^’’ 


(13) 


where p = p{x, t|xo, 0) is the transition probability of the underlying Markov process. One may show that 
p{xfl\xo,0) is a Gaussian process with mean and variance given by [HHl: 


{X{t))=xoe varX(f) 


cr 




We see that we get transition probabilities of mean xqC and variance independent of Xq. Since all the 
transitions p{x,t\xo,0) have the same variance (and are Gaussian) the 1-Wasserstein distance [fT^ 

Wi{p{xfl\xo,0),p{x,6t\xi,0)) = |a:oe"“* - 


Finally, 


n{xo,Xi) 


Wi{p{x, t\xo, 0),p{x, t\xi, 0)) 
|a;o - xi\ 


1 - e-“*. 


(14) 


Equation ( fH] ) illustrates the connection of fluctuation in a very simple explicit manner. Larger a corre¬ 
sponds to larger curvature k and this corresponds to how quickly the systems returns to equilibrium, that 
is to the mean going to 0. This can be seen in Figure for two given processes with differing a. 

Random vs Biological Graphs: The second example involves the computation of the Ollivier-Ricci 
curvature for a random graph as compared to that of a widely available biological network (transcription 
network of E. Coli taken from [|3l). Figure ^ shows the results for this computation of both networks. 
It has already been argued by Uri Alon [[21 that this transcription network exhibits a great degree of 
robustness and this is shown here. We also note that the random network has average Ricci curvature of 
about —0.352085 with 63% of the edges being negative, while for E. Coli we have average curvature of 
—0.0222 with 50% of the edges being negative. 







(a) (b) 

Fig. 3: (a) OU process with parameters: a = 1.0, a= 1, x(0) = 1. Ollivier-Rieei Curvature Computed 
at t = 1: K{x,y) = 0.6321 (b) OU proeess with parameters: a = 0.1, a= 1, a;(0) = 1. Ollivier-Rieei 
Curvature Computed at t = 1: K{x,y) = 0.0952. 




(a) (b) 

Fig. 4: (a) Olliver-Rieci curvature of random graph: darker colors represent more negative curvature (b) 
Olliver-Rieei eurvature of transeription network of E. Coli: darker colors represent more negative curvature. 


B. Scalar curvature 


Until now, we have eonsidered Rieei eurvature (in the Ollivier sense), which is defined between any 
two vertiees on a graph. One may eonsider other forms of eurvature sueh as sealar eurvature to further 
illuminate certain geometric network properties. Specifically, scalar curvature represents the amount by 
which the volume of a geodesic ball in a curved Riemannian manifold deviates from that of the standard 
ball in Euclidean spaee ifT^ . Given our previous definition and notation for Olliver-Ricci curvature, the 
sealar eurvature on a graph may be defined to be 

S{.x) ■.= ^n{x,y). (15) 

y 


Note that the sealar eurvature on a Riemannian manifold is essentially the eontraetion of the Rieei eurvature 
tensor with respeet to the metrie [fT2ll (up to sealing faetor). If one only takes adjaeent vertiees with respeet 


to the hop metrie, (15) is eompatible with the usual eontraetion. One may also eontraet with respeet to 
the family of probability measures, that is, we take the sealar curvature to be defined as 




(16) 


In praetiee, both definitions lead to similar results on the networks tested, and we will eompare them 
below. 
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Both S{x) and S{x) will be seen to provide useful “nodal measures” on the set of vertices in the graph, 
which will become important in comparing previously used entropy measures for biological networks. 
For example, the authors in [[591 define a notion of entropy (just using direct paths over a graph) as 


Six) 



^f^xiy) log/ix(i/) 
y 


(17) 


and where dx is the unweighted degree of node x. See [l59l for more general definitions when the vertices 
need not be adjacent. 


We will now show in Section IV that all forms of curvature presented here indicate a particular “hallmark” 
in cancer networks studied herein. Finally, we note whereas the Ollivier-Ricci curvature maybe useful in 
deriving a flow to weaken (or strengthen) a network’s robustness, scalar curvature may be useful for 
identifying nodes (genes) as potential targets and examining the network in a more global fashion. For a 
nodal measure, it seems that S and S give comparable results, but the curvature based measure S has the 
numerical advantage of not involving a logarithm that is discontinuous at 0. The Ricci curvature gives a 
new way of looking at nodal interactions. 


IV. Robustness of Cancer Networks 

Of greatest interest to us is the study of cancer networks and the relation of curvature. In particular, 
we present preliminary results demonstrating that cancer networks exhibit a greater degree of functional 
robustness with respect to normal tissue networks (as argued by [|59l ) on both metabolic and transcriptional 
networks. We further show that this “hallmark,” while significant at the macroscopic network scale, can 
be utilized to illuminate specific gene-to-gene pairs contributing to the networks robustness (or fragility) 
with possible associations regarding the onset and proliferation of cancer [|5T][ . [fT7][ . Lastly, we note 

that quantification by curvature is different than previous quantification methods such as differential gene 
expression. In what follows, we present a description of the preliminary data and corresponding tabulated 
results. 


A. Brief Description of Data Sets 

Our preliminary data (provided by Memorial Sloan Kettering) consists of metabolic and transcriptional 
regulation networks. In studying metabolic networks, we are provided eight different cases for which 
correlation values of gene-to-gene expression in cancer and normal networks are given. With respect to 
transcription networks, we are provided two cases of gene-to-gene correlation data sets associated with 
breast cancer and kidney renal clear cell carcinoma. The network is constructed using these correlation 
values as weights of the graph and the topology (adjacency matrix) of graph is given by underlying 
biological gene-to-gene interactions. 


B. Metabolic Networks 


The metabolic network data is similar to data utilized in [j^ with an investigation of an “arbitrary” set 
of 800 genes and their interactions. An important note regarding this metabolic data is that certain key 
genes associated with cancer (e.g., BRCAl, TP53) are not present. 


B.l Metabolic Networks: Curvature as a Measure for Robustness 

Given the above, we first present the difference (cancer-normal) in average curvature of each metabolic 
network. This is summarized in Tables |I[ |I^ and III for scalar curvature, Ricci curvature, and weighted 
scalar curvature, respectively. We note that the computation here is taken over just direct paths (for 
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necessary comparison to entropy) and that a complete computation (over all possible paths) still yields 
comparable results presented below. Indeed, since a graph is a 1-geodesic space, if k{x, y) > k on adjacent 
vertices, then this will be true as well for non-adjacent vertices [[JTlI . Further, we note that while the 
average curvature is not the optimal statistic, it nevertheless illustrates that the cancer network as a whole 
exhibits greater robustness (in terms of curvature) than its corresponding normal tissue network. 


Cancer Type 

A Scalar Curvature 

Cancer Tissue 

Normal Tissue 

Breast Cancer 

0.4284 

1.9649 

1.5185 

Head / Neck Squamous Cell Carcinoma 

0.3928 

1.9610 

1.5232 

Kidney Renal Clear Cell Carcinoma 

0.4627 

1.9724 

1.4647 

Kidney Papillary Carcinoma 

0.5445 

1.8425 

1.2980 

Liver Cancer 

0.1088 

1.8405 

1.7318 

Lung Adenocarcinoma 

0.2059 

2.0563 

1.8504 

Prostate Cancer 

0.2896 

1.8970 

1.6074 

Thyroid Cancer 

0.0829 

1.6937 

1.6108 


TABLE I: Tabulated results for all eight networks with respect to average scalar curvature. Cancer can be 
seen to more robust than its complementary normal network in each of the eight cases. 


One can see that scalar curvature provides a strong and clear indicator regarding cancer robustness 
compared to that of healthy tissue. This being said, in the next two sections, we provide a list of the top 
twenty gene-to-gene pairs for two of the above cases (given the scope of discussion) to further highlight 
the proposed cancer “hallmark.” Note, unlike that of previous approaches utilizing entropy [l59l . we are 
able to uncover important pairwise information through Ollivier-Ricci curvature. 


Cancer Type 

A Ricci Curvature 

Cancer Tissue 

Normal Tissue 

Breast Cancer 

0.0235 

0.1071 

0.0836 

Head/Neck Squamous Cell Carcinoma 

0.0217 

0.1058 

0.0841 

Kidney Renal Clear Cell Carcinoma 

0.0253 

0.1054 

0.0801 

Kidney Papillary Carcinoma 

0.0300 

0.1020 

0.0719 

Liver Cancer 

0.0060 

0.1011 

0.0951 

Lung Adenocarcinoma 

0.0114 

0.1137 

0.1023 

Prostate Cancer 

0.0160 

0.1049 

0.0889 

Thyroid Cancer 

0.0046 

0.0940 

0.0894 


TABLE II: Tabulated results for all eight networks with respect to average Ricci curvature. Cancer can 
be seen to more robust than its complementary normal network in each of the eight cases. 


Cancer Type 

A Weighted Curvature 

Cancer Tissue 

Normal Tissue 

Breast Cancer 

0.0139 

0.1496 

0.1358 

Head / Neck Squamous Cell Carcinoma 

0.0144 

0.1505 

0.1361 

Kidney Renal Clear Cell Carcinoma 

0.0105 

0.1515 

0.1410 

Kidney Papillary Carcinoma 

0.0122 

0.1501 

0.1379 

Liver Cancer 

0.0054 

0.1527 

0.1473 

Lung Adenocarcinoma 

0.0087 

0.1587 

0.1500 

Prostate Cancer 

0.0115 

0.1495 

0.1380 

Thyroid Cancer 

0.0069 

0.1430 

0.1361 


TABLE III: Tabulated results for all eight networks with respect to average weighted scalar curvature. 
Cancer can be seen to more robust than its complementary normal network in each of the eight cases. 


B.2 Metabolic Networks: Top 20 Gene Pairs of Breast Cancer 

In this section, we present the top twenty gene-to-gene pairs (as sorted by difference in Ollivier-Ricci 
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Gene 

Rank 

A Ricci Curvature 
(Cancer-Normal) 

Gene X 
(Symbol) 

Gene Y 
(Symbol) 

1 

0.4642 

ACAA2 

MY05B 

2 

0.4170 

LPO 

SOD3 

3 

0.4162 

BPNTl 

GLYCTK 

4 

0.4067 

GLYCTK 

GAL3ST1 

5 

0.4057 

PPAP2A 

PPAP2C 

6 

0.3916 

ACOl 

OAT 

7 

0.3881 

HADHB 

MY05B 

8 

0.3805 

GUCY2D 

NPR2 

9 

0.3687 

GUCY2D 

NPRl 

10 

0.3624 

GOT2 

AC02 

11 

0.3605 

GUCY1A2 

GUCY2D 

12 

0.3602 

GOT2 

ACOl 

13 

0.3600 

SGMSl 

LRAT 

14 

0.3598 

SLC25A22 

SLC25A18 

15 

0.3584 

AQP9 

AGMAT 

16 

0.3520 

ETFDH 

PYCRl 

17 

0.3518 

OAT 

AC02 

18 

0.3449 

GPX3 

LPO 

19 

0.3373 

ALDH9A1 

ALDH7A1 

20 

0.3317 

PYGB 

PYGL 


TABLE IV: The top 20 gene pairs in the 
metabolie network for breast tissue ease sorted 
with respect to Ricci curvature. 


Gene 

Rank 

A Ricci Curvature 
(Cancer-Normal) 

Gene X 
(Symbol) 

Gene Y 
(Symbol) 

1 

0.3418 

PGDS 

SLC02A1 

2 

0.2807 

NEU4 

CMAS 

3 

0.2364 

HGD 

HPD 

4 

0.2322 

SLC26A2 

UROD 

5 

0.2287 

SLC26A3 

UROD 

6 

0.2201 

PRDX3 

CYP24A1 

7 

0.2177 

SLC26A1 

UROD 

8 

0.2085 

PPAP2A 

DGATl 

9 

0.2011 

GLYCTK 

GAL3ST1 

10 

0.1976 

SLC26A11 

UROD 

11 

0.1967 

SGMSl 

DGATl 

12 

0.1924 

SLC26A8 

UROD 

13 

0.1860 

NEU4 

SLC17A5 

14 

0.1849 

ACADS 

ACAD 11 

15 

0.1826 

SLC26A6 

UROD 

16 

0.1817 

CHIA 

RENBP 

17 

0.1806 

SLC26A9 

UROD 

18 

0.1762 

CPTIB 

PIGW 

19 

0.1737 

CPTIA 

PIGW 

20 

0.1727 

HYAL3 

LPGATl 


TABLE V: The top 20 gene pairs in the 
metabolic network for lung adenocarcinoma tis¬ 
sue case sorted with respect to Ricci curvature. 


curvature) for the case of breast cancer. This can be seen in Table IV Most importantly, we should note 
that by ranking such pairs via differences in curvature, we are able to “uncover” specific cancer related 
genes. Eor example, the gene LPO (Lactoperoxidase) has been known to contribute to the initiation of 
breast cancer ITTTI . SODS has been considered an important gene in the defense against oxidative stress 
and prevention of estrogen-mediated breast cancer [|431 . PPAP2C has been described as anti-cancer drug 
target [fT4l . GOT2 has been noted to significantly affect cell growth of pancreatic ductal adenocarcinoma 
and over-expression of LRAT has lead to a poor prognoses in colorectal cancer. Interestingly, the 


reoccurrence of genes ACOl and AC02 can also be seen in Table IV One possible explanation can 
be attributed to the fact that these genes regulate iron storage in a given cellular environment, which 
has recently been highlighted to be connected to increase risk for cancer (in terms of cell proliferation 
and growth) [fSTll . Intuitively, and as argued by li59l . metastasizing cancer exhibits a greater degree of 
robustness than benign tumors, which in turn, could explain the reasoning of why ACOl and AC02 are 
a common occurrence. 


B.3 Metabolic Networks: Top 20 Gene Pairs of Lung Adenocarcinoma 

In this section, we again present the top twenty gene-to-gene pairs with respect to Ricci curvature, but 
for the case of lung adenocarcinoma. This can be seen in Table |Vj In addition to the biomarkers for lung 
cancer such as PODS it is interesting to note the reoccurrence of the gene UROD (Uroporphyrinogen 
Decarboxylase). It has been documented that this gene can be targeted to be “knocked out” for a variety 
of cancers and is a driving force behind several drug therapy methodologies [|23l, [|5^ . 

BA Metabolic Networks: Comparison of Entropy and Curvature 

In the previous sections, we utilized the fact that curvature is positively correlated with that of entropy 
to quantify the robustness of a network. While these two metrics may be used, the discriminatory power 
of each measure may differ. In this section, we provide a comparison of entropy with that of scalar and 
Ricci curvature (and hence repeat some results for the sake of clarity). Utilizing equation ( [TT] ), we are 
able to compute entropy as a nodal measure and fairly compare that to that of scalar curvature (also a 
nodal measure). Eor completeness, we repeat the average difference in Ricci curvature. These results are 
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provided in Table VI 


Cancer Type 

A Entropy 

A Scalar Curvature 

A Ricci Curvature 

Breast Cancer 

0.0146 

0.4284 

0.0234 

Head / Neck Squamous Cell Carcinoma 

0.0152 

0.3928 

0.0216 

Kidney Renal Clear Cell Carcinoma 

0.0168 

0.4627 

0.0252 

Kidney Papillary Carcinoma 

0.0189 

0.5455 

0.0300 

Liver Cancer 

0.0030 

0.1088 

0.0060 

Lung Adenocarcinoma 

0.0049 

0.2059 

0.0113 

Prostate Cancer 

0.0110 

0.2896 

0.0159 

Thyroid Cancer 

0.0031 

0.0829 

0.0046 


TABLE VI: Comparison of Entropy and Different Notions of Curvature. 


We again note that from a computationally comparison standpoint, the proposed usage of a curvature can 
be simply implemented as a highly parallelizable linear program compared to that of li59l . Thus, for paths 
that are larger than one, our computation remains unaffected. Moreover, entropy defined in equation ( [T7| ) 
requires a minimum of two incidences per node in order for the measure to be well-behaved. Utilizing 
Ricci (or scalar) curvature, we do not impose such restrictions. This is particularly important, as many 
networks possess a degree distribution that follows the power law [HI. 


C. Transcriptional Networks 


The transcription network data consists of two of the above cancer cases with investigation of 450 cancer- 
related genes and their interactions. An important note, unlike that of the metabolic data, we focus on the 
inter-relationship of cancer related genes (e.g., BRCAl, TP53 are present). 


C.l Transcriptional Networks: Curvature as a Measure for Robustness 

Similar to that of B.l and for the sake of brevity, this section presents the average difference (cancer- 
normal) in scalar curvature of two transcription networks provided by MSKCC. In Table |VII[ we again 


note that scalar curvature is a clear indicator of robustness and while not shown below, average Ricci 
curvature also yields similar results. 


Cancer Type 

A Scalar Curvature 

Cancer Tissue 

Normal Tissue 

Breast Cancer 

0.4292 

8.2037 

7.7744 

Kidney Renal Clear Cell Carcinoma 

0.0728 

8.1460 

8.7033 


TABEE VII: Tabulated results for the two transcription networks with respect to average scalar curvature. 
Cancer can be seen to more robust than its complementary normal network in each of the two cases. 


C.l Transcriptional Networks: Top and Bottom 10 Pairs 

It is of particular interest to examine a network at the local level to illuminate certain attributes of cancer. 
In this section, we present the top and bottom ten gene-to-gene pairs of the possible interactions (as sorted 
by difference in Ollivier-Ricci curvature) for the transcription network cases of breast cancer and kidney 
renal clear cell carcinoma. This can be seen in Table jVIIIi Table ||^ Table and Table 


Erom the aforementioned tables, a few insights are noted. Eirstly and in the particular case of breast cancer, 
RNE43 and TP53 are known tumor suppressor genes and ranked in the top and last gene pairs (out of 

one can see several genes related to cell 


a lOOK possible pairs). In fact, from Table |VIII| and Table] 
proliferation, division, and growth (e.g., GPC3, POTl, RNP43, TP53). With respect to kidney renal clear 
cell carcinoma, it is interesting to note the reoccurrence of PAX8 and its role in adult kidney and renal 
tumors. Such large differences relating to an increase/decrease in robustness for such genes require further 
analysis and provides an exciting avenue into understanding the overall robustness of cancer networks. 
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Gene 

Ranking 

A Ricci Curvature 
(Cancer-Normal) 

Gene X 
(Symbol) 

Gene Y 
(Symbol) 

1 

0.3504 

RNF43 

RSP03 

2 

0.3444 

RNF43 

RSP02 

3 

0.3012 

ERG 

ETVl 

4 

0.3001 

GPC3 

PTCHl 

5 

0.2901 

GPC3 

SDC4 

6 

0.2796 

POTl 

SBDS 

7 

0.2538 

FGFR2 

KDR 

8* 

0.2510 

ETVl 

FEV 

9 

0.2460 

ERG 

FOXAl 

10 

0.2410 

EXTl 

SDC4 


TABLE VIII: Top 10 Pairs in Breast Cancer. 


Gene 

Ranking 

A Ricci Curvature 
(Cancer-Normal) 

Gene X 
(Symbol) 

Gene Y 
(Symbol) 

99226* 

-0.2203 

POTl 

RNF43 

99227 

-0.2214 

ELF4 

FEV 

99228* 

-0.2304 

MUCl 

RNE43 

99229 

-0.2489 

IDHl 

SBDS 

99230* 

-0.2496 

ERCC2 

RNE43 

99231* 

-0.2839 

PRDMl 

RNE43 

99232 

-0.2851 

CCND3 

ETVl 

99233 

-0.3632 

RNF43 

SFPQ 

99234 

-0.3636 

NONO 

RNF43 

99235 

-0.3651 

RNF43 

TP53 


TABLE X: Bottom 10 Pairs in Breast Cancer. 


Gene 

Ranking 

A Ricci Curvature 
(Cancer-Normal) 

Gene X 
(Symbol) 

Gene Y 
(Symbol) 

1 

0.2370 

LMOl 

LYLl 

2 

0.2300 

LMOl 

TALI 

3 

0.2204 

MAE 

NEATC2 

4 

0.1837 

DDXIO 

NUP214 

5* 

0.1827 

DDXIO 

ELN 

6 

0.1688 

ERBB2 

PDGFRB 

7 

0.1681 

IDHl 

SBDS 

8 

0.1661 

ELT3 

ROSl 

9 

0.1613 

NTRK3 

ROSl 

10 

0.1601 

IL6ST 

MITE 


TABLE IX: Top 10 Pairs in Kidney Cancer. 


Gene 

Ranking 

A Ricci Curvature 
(Cancer-Normal) 

Gene X 
(Symbol) 

Gene Y 
(Symbol) 

99226* 

-0.1730 

KDM5A 

PAX8 

99227* 

-0.1731 

BCLllA 

PAX8 

99228* 

-0.1766 

PAX8 

ZNP521 

99229* 

-0.1821 

PAX8 

PWWP2A 

99230 

-0.1857 

POTl 

SBDS 

99231 

-0.1878 

GPC3 

PTCHl 

99232 

-0.1900 

CREB3L2 

GPHN 

99233 

-0.1903 

DDXIO 

MYCLl 

99234 

-0.1935 

ELP4 

EEV 

99235 

-0.1942 

CD74 

MYC 


TABLE XI: Bottom 10 Pairs in Kidney Cancer. 


Additionally, one more important observation regarding the above computation is the uncovering of 
“hidden” pairs (denoted by * in gene ranking). That is, we compute curvature here on all possible 
pair/pathways as opposed to pairs only defined by the underlying biological adjacency matrix and results 
given above. Examining all possible pairs (in a feasible linear program computation) further illuminates 
possible important inter-relationships in a given network (e.g., interactions with PAX8). 


B.4 Transcriptional Networks: Comparison of Entropy and Curvature 

Similar to that of B.4 and for the sake of brevity, this section presents a comparison of entropy with that 
of curvature for the two transcription networks provided by MSKCC. In Table |XII[ we compare scalar 
curvature and entropy as measures of robustness. 


Cancer Type 

A Entropy 

A Scalar Curvature 

A Ricci Curvature 

Breast Cancer 

0.0117 

0.4292 

0.0117 

Kidney Renal Clear Cell Carcinoma 

0.0060 

0.0728 

0.0020 


TABLE XII: Tabulated results for the two transcription networks with respect to average scalar curvature. 
Cancer can be seen to more robust than its complementary normal network in each of the two cases. 


V. Conclusion and Euture Research 

In this work, we have considered using the geometric notion of curvature in order to study the functional 
robustness of cancer networks. There are some interesting related ideas that we will be considering in 
future work that may also have some bearing in network changes in topology. 

Eirst, in some very interesting work, Billera et al. describe a metric geometry on the space of trees in 
connection with phylogenetics. It turns out that their space is a moduli space (universal parameter space) 
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and has non-positive curvature. From the results of [|47l . this allows one to do statistics on this space 
since between any two points there is a unique geodesic. This has had a number of intriguing applications 
including to cancer research [|M1l . It would be very interesting to generalize this to more general network 
structures, and instead of just looking at the geometric (curvature) property of an individual network to 
devise quantitative statistical methods based on the metric geometry comparing families of networks. 

The work of m is largely motivated by the problem of cancer cell heterogeneity. Indeed, cancer 
progression is believed to follow Darwinian evolutionary pattern: fitter subtypes replace other less fit cells, 
which leads to disease. In combination with high-throughput genomics one can construct trees to study 
this process. This is an example of a deep relationship between the concepts of Darwinian evolution and 
Boltzmann thermodynamics advocated by Demetrius flU]. The idea is that macroscopic entropy increases 
under microscopic molecular collisions, while macroscopic evolution can be (partially) explained via the 
concept of the increase of entropy. This reasoning is very much in line with the overall thrust of the 
present paper in which we are trying to use curvature (positively correlated with robustness) to quantify 
network robustness. The macroscopic theory is also very much in line with Boltzmann thermodynamics. 
Evolutionary changes and network adaptability are key topics to be considered in future research. 
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